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Pol III Promoters to Express Small RNAs: Delineation of 
Transcription Initiation 

Hongming Ma^YongganWu^Ying Dang^ Jang-Gi Choi^ Junii Zhang^ and HaoquanWu^ 

Pol III promoters such as U6 are commonly used to express small RNAs, including small interfering RNA, short hairpin RNA, 
and guide RNA, for the clustered regularly interspaced short palindromic repeats genome-editing system. However, whether the 
small RNAs were precisely expressed as desired has not been studied. Here, using deep sequencing to analyze small RNAs, we 
show that, for mouse U6 promoter, sequences immediately upstream of the putative initiation site, which is often modified to 
accommodate the restriction enzyme sites that enable easy cloning of small RNAs, are critical for precise transcription initiation. 
When the promoter is kept unmodified, transcription starts precisely from the first available A or G within the range of positions 
-1 to +2. In addition, we show that transcription from another commonly used pol III promoter, H1, starts at multiple sites, which 
results in variability at the 5' end of the transcripts. Thus, inaccuracy of 5' end of small RNA transcripts might be a common 
problem when using these promoters to express small RNAs based on currently believed concepts. Our study provides general 
guidelines for minimizing the variability of initiation, thereby enabling more accurate expression of small RNAs. 
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Introduction 

Because of the ease of silencing target genes, snnall interfer- 
ing RNA (siRNA)/short hairpin RNA (shRNA) is currently one 
of the nnost connnnonly used tools for bionnedical research, 
and particular siRNAs/shRNAs are also being explored as 
therapeutic candidates in a nunnber of diseases. To provide 
long-ternn suppression of target genes, siRNA/shRNAs are 
connnnonly expressed with pol III pronnoters partly because of 
their defined sites for transcription initiation and ternnination J-^ 
While the two siRNA strands can be transcribed separately 
to generate the siRNA duplex, ^"^'^ shRNAs are transcribed as 
a single transcript with a hairpin structure, ^■^■^■^■^ which is pro- 
cessed by Dicer into siRNA duplexes. 

Clustered regularly interspaced palindronnic repeats 
(CRISPR)-Cas9 systenn is a natural nnechanisnn in bacteria 
and archaea to defend against foreign DNA.^°'^^ The type II 
prokaryotic CRISPR-Cas9 endonuclease systenn has been 
recently applied as a new strategy for targeted genonne engi- 
neering in which a short chinneric guide RNA (gRNA) guides 
Cas9 nuclease to edit genonnic target sites.^^"^^ Connpared 
with other genonne-editing technologies, such as zinc finger 
nuclease and transcription activator-like effector nuclease, 
this new technology can be used to edit any genonne tar- 
get site with ease. The Cas9 enzynne being invariant, the 
only requirennent for editing any chosen genonnic site is the 
expression of specific snnall guide RNAs.^^-^^ 

Expressing snnall RNAs with desired sequence is a criti- 
cal step for both siRNA/shRNA and CRISPR-Cas9 tech- 
nologies. For siRNA, expressing two snnall RNAs with the 
exact designed sequences is critical to ensure loading of 
the desired strand into the RNA-induced silencing connplex 
and to nnaintain specificity for the target gene. For shRNA, 



the 5' end of shRNA transcripts is also critical, because the 
transcripts fornn a stenn loop structure that is processed by 
Dicer, which nneasures 21 to 22 nt fronn the 5' end of the tran- 
script and perfornns a cleavage to generate the nnature siRNA 
duplex.2^'22 Even one nucleotide difference at the 5' end of the 
transcript will generate a different siRNA duplex. For gRNAs 
of the CRISPR systenn, the first ~20-nt sequence of the guide 
RNA transcript defines the CRISPR target.^°'^3-i6,i8,i9 jh^s, 
precision at the 5' end of these snnall RNA transcripts is criti- 
cal for these technologies to work properly. 

RNA pol III pronnoters, such as U6 and HI , are connnnonly 
used to express these snnall RNAs. It is believed that nnouse 
U6 pronnoter transcription starts at the +1 position (23 nt 
after the TATA box), with G as the preferred initiation nucleo- 
tide.2^-2^ However, the exact U6 transcription initiation site 
has not been rigorously studied. Here, we show that the ini- 
tiation of snnall RNAs driven by the nnouse U6 pronnoter is 
often not at the presunned initiation site and is affected by the 
surrounding sequence. In addition, we found that the tran- 
scription initiation site of another connnnonly used pronnoter, 
HI , is generally variable. Thus, variability in the initiation site, 
which results in innprecision at the 5' end of snnall RNA tran- 
scripts, nnight be a connnnon problenn. Our study provides 
general guidelines for using the pol III pronnoters to express 
snnall RNAs accurately. 

Results 

U6 transcription initiation is not always at the +1 position 

sh1005 is an shRNA that efficiently targets CCR5 expres- 
5jQi^26-32 i^gg approved for a clinical trial ainning to 
treat HIV. However, U6-driven shRNAs showed high toxic- 
ity when used in T cells.^^'^^ Because nniRNA-based shRNA 
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mitigates the toxicity of conventional shRNA and also might 
have better functionality,^^'^^ we converted its presumed 
mature sequence (GGUGUAAACUGAGCUUGCUCUUP^o 



into miRNA-based shRNAs using pri-miR-30 or pri-miR-150 
as backbone (see Supplementary Figure S1a). Surpris- 
ingly, both miRNA-based shRNAs showed significantly 
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decreased functionality compared with the original sh1005 
(see Supplementary Figure S1b). To understand how the 
original sh1005 and sh1005 in the miRNA backbone are 
processed by the endogenous miRNA machinery and why 
miRNA-based shRNAs are less efficient, we transfected the 
constructs into 293FT cells and sequenced the small RNAs. 
The miRNA-based shRNAs were processed precisely as pre- 
dicted to generate the presumed mature sh1005 sequence 
(NCCR5; see Supplementary Figure S1c). To our surprise, 
the predominant mature product generated from sh1005 was 
UGUAAACUGAGCUUGCUCUUU (termed WCCR5), not its 
presumed mature sequence GGUGUAAAGUGAGCUUGCU- 
CUU (termed NCCR5; Figure 1a and Supplementary Table 
S1 and S2). To test whether this is the cause of the compro- 
mised functionality, we constructed two new miRNA-based 
shRNAs to express the WCCR5 sequence (see Supplemen- 
tary Figure S1d). The processing of these miRNA-based 
shRNAs was confirmed by small RNA sequencing, and the 
results showed that they were processed exactly as predicted 
(see Supplementary Figure S1e). The functionality of these 
two new miRNA-based shRNAs (WCCR5) was higher than 
that of the previous miRNA-based shRNAs (NCCR5) and 
slightly higher than that of sh1 005 (Figure 1 b). The function- 
ality of shRNAs was further confirmed by luciferase reporter 
assay (Figure 1c). These data show that the presumptive 
mature siRNA sequence according to the current concep- 
tion can be significantly in error, and knowing the exact small 
RNA products generated from shRNAs is critical for design- 
ing optimal miRNA-based shRNAs. 

It has been shown that, unlike the 3' end of mature prod- 
ucts of shRNA that is subject to extensive modification after 
its biogenesis, the 5' end of mature products is generally 
protected from modification^^-^^ and therefore, the 5' end of 
mature products generated from the 5' arm of shRNA should 
faithfully indicate the original transcription initiation site. It is 
interesting that, judging from the predominant mature prod- 
ucts generated from the 5' arm of sh1 005, the initiation site of 
transcription for sh1005 is at the -1 (A) position, not the gen- 
erally accepted +1 (G) position for the mouse U6 promoter 
(Figure 1a,d).The single-nucleotide shift of the sh1005 tran- 
scription initiation site extended the stem by 1 bp (Figure la, 
lower panel). Since Dicer measures 21-22 nt from the 5' end 
of the transcript and performs a cleavage to generate the 
mature siRNA duplex,^^'^^ this extension may have caused a 
shift in the Dicer cleavage site and thus may explain why the 
mature siRNA sequence is not the presumed one. However, 
it appears that variability of transcription initiation might be a 



common problem in using the U6 promoter to express small 
RNAs. A recent report in which the U6 promoter was used 
to express miR-30 also showed that the initiation site is not 
precise and that the actual dominant mature product gener- 
ated is not miR-30 (69% of the reads were 3 nt shorter at 
the 5' end)."^^ The precision of the initiation site is critical for 
expressing shRNA or miRNA mimics to generate the desired 
mature sequences. Thus, we wanted to determine the exact 
initiation site of the U6 promoter. We first randomly selected 
seven shRNAs available in the laboratory, transfected them 
into 293FT, and sequenced the small RNAs. The results 
showed that the initiation site was not always at the +1 (G) 
position and that it followed a pattern: if the -1 position is 
an A, the initiation site is at -1 , whereas if the -1 position 
is a T, the initiation site is at +1 (G, Figure 1e). This pattern 
is reminiscent of the native mouse U6 sequence, which has 
the sequence GTTT upstream of the +1 (G) position (Figure 
Id). Thus, it appears that if the -1 position happens to be a 
T, as in the native U6 promoter, the initiation site will be at +1 
(G). If the -1 position is changed to A, the initiation site will 
be changed to the -1 position, suggesting that the nucleo- 
tides upstream of the initiation sites can affect transcription 
initiation. 

Transcription initiation is affected by the sequence 
around the initiation site 

To understand how the surrounding sequence affects U6 
promoter initiation site selection, we systematically inves- 
tigated the effect of nucleotide alterations around the +1 
position on the transcription initiation of sh1005 by inserting 
different nucleotides upstream, as shown in Figure 2a. The 
constructs were transfected into 293FT cells separately, and 
small RNA products were sequenced to determine the ini- 
tiation site. Transcription initiation site selection appeared to 
follow a simple rule: transcription starts from the first A or 
G beginning at the -1 position (Figure 2a). It appears that 
transcription can also start from the first C, but with much 
less efficiency (Table 1). According to read abundances, 
transcription efficiency appears to decrease if A/G occurs at, 
or downstream of, the +3 position (Table 1). To confirm the 
effect of a change in initiation site on transcription efficiency, 
we measured the functionality of shRNAs, which decreased 
significantly if the initiation site was at the +3 position, as is 
the case for shR-nccr5-TTT, -TTTA, or -TTC, and the func- 
tionality was reduced further if the initiation site was at the 
+4 position (shR-nccr5-TTTT; Figure 2b). The results appear 
to be consistent with small RNA cloning data, suggesting 



Figure 1 Transcription from the U6 promoter does not necessarily start from the +1 (G) position, (a) sh1005 was transfected into 293FT 
cells, and small RNAs were analyzed by deep sequencing. Representative reads of sequenced small RNAs are shown (all small RNAs 
sequenced for sh1005 can be found in Supplementary Tables S2 and S3). "X" indicates the positions at which these reads might end. The 
putative initiation site of transcription is boxed, and the real initiation site is marked in red. The Dicer cleavage site of the presumed mature 
sequences (NCCR5) and the real mature sequences (WCCR5) are indicated by arrows, (b) CCR5 knockdown efficiency of shRNAs in TZM-bl 
cells. TZM-bl cells were transfected with shRNAs and analyzed for CCR5 expression by flow cytometry 72 hours later. The bar graph on the 
right represents averages (±SD) of triplicates. MFI, mean fluorescence intensity, (c) The functionality of sh1005 and its miRNA-based shRNA 
counterpart was tested with the dual-luciferase assay, which was performed 24 hours after cotransfection of shRNAs with a pCHECK2 vector 
containing a 2x repeat of a sequence in the 3' UTR that is fully complementary to the target sequence. The ratio of Renilla luciferase (Rluc, 
reporter) to firefly luciferase (Flue, internal control) normalized to the negative control (mock vector) is shown. Error bar, 1 SD (d) Comparison 
of the initiation site of the natural mouse U6 small RNA with that of sh1005. (e) The transcription initiation site for seven randomly selected 
shRNAs was determined by analyzing the small RNAs after transfection. The initiation site is determined according to the 5' end of the 
predominant sequenced small RNAs generated from 5' arm of shRNA. All sequenced small RNAs can be found in Supplementary Tables 
SI and S2. UTR, untranslated region. 
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Figure 2 Analysis of the transcription initiation site driven by the U6 promoter, (a) The transcription initiation site for sh1 005 serial mutations. 
The predominant sequenced small RNAs can be found in Table 1 . Lowercase letters indicate the nucleotides replacing the A at the -1 position 
of sh1005.The presumed initiation site of transcription is boxed, and the real initiation site is marked red. The original sh1005 is marked bold, 
(b) The functionality of sh1005 serial mutations. The dual-luciferase assay was performed as in Figure 1c. (c) The transcription initiation 
site for small RNAs (-21 nt in length) in which the nucleotides around the initiation site are different. Lowercase letters indicate nucleotides 
inserted between the -2 position of the U6 promoter and the universal sequence GATAATTTGTGGTAGTGGTT. Asterisks indicates constructs 
for which an insufficient number of reads were sequenced to determine the exact initiation site. The predominant small RNAs can be found in 
Supplementary Figure S2b. (d) The sequence immediately upstream of the -1 site affects transcription initiation. The initiation sites of the 
indicated sh1005 mutations with altered sequences immediately upstream of the -1 site are marked red. The predominant sequenced small 
RNAs can be found in Table 2. 
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Table 1 Representative small RNA reads cloned from sh1005 serial mutations in Figure 2a 

#Sh1005-T UGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

. GAGCAAGCUCAGUUUACXXXXXXXXX 

. . . GCAAGCUCAGUUUACAXXXXXXXX 

#Sh1005-G GGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

GGAGCAAGCUCAGUUUXXXXXXXXXXX 

. GAGCAAGCUCAGUUUACAXXXXXXXXX 

#Sh1005-C CGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

CGAGCAAGCUCAGUUUAXXXXXX 

. GAGCAAGCUCAGUUUAXXXXXXXXXXX 

#Sh1005-TT UUGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

. . GAGCAAGCUCAGUUUAXXXXXXXXXXX 

#Sh1005-TTT UUUGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

. . UGAGCAAGCUCAGUUUACACXX 

. . .GAGCAAGCUCAGUUUAXXXXXXXXXXX 

#sh1005-TTTT uuuugagcaagcucaguuuacaccuuguccgacgguguaaacugagcuugcucuuuuu 

.... GAGCAAGCUCAGUUUACXXXXXXXX 

#Sh1005-TA UAGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

. AGAGCAAGCUCAGUUUXXXXXXXXXXXX 

#Sh1005-TTA UUAGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

. . AGAGCAAGCUCAGUUUXXXXXXXXXXX 

#sh1005-TTTA uuuagagcaagcucaguuuacaccuuguccgacgguguaaacugagcuugcucuuuuu 

. . . AGAGCAAGCUCAGUUUXXXXXXXXXXXX 

#Sh1005-TC UCGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

UCGAGCAAGCUCAGUUUXXXXX 

. CGAGCAAGCUCAGUUUACXXXXXXX 

. . GAGCAAGCUCAGUUUAXXXXXXXXXXX 

#Sh1005-TTC UUCGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

. UCGAGCAAGCUCAGUUUACAXXX 

. . CGAGCAAGCUCAGUUUACAXXXXXX 

. . . GAGCAAGCUCAGUUUAXXXXXXXXXX 

#Sh1005-CT CUGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

CUGAGCAAGCUCAGUUUACXXX 

. UGAGCAAGCUCAGUUUACXXXXXXXXX 

. . GAGCAAGCUCAGUUUAXXXXXXXXXXXX 

#Sh1005-TG UGGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

UGGAGCAAGCUCAGUUUACXXX 

. GGAGCAAGCUCAGUUUXXXXXXXXXXXX 

#Sh1005-CA CAGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

. AGAGCAAGCUCAGUUUXXXXXXXXXXXX 

#Sh1005-CG CGGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

CGGAGCAAGCUCAGUUUXXXXXX 

. GGAGCAAGCUCAGUUUXXXXXXXXXXXX 

#Sh1005-CC CCGAGCAAGCUCAGUUUACACCUUGUCCGACGGUGUAAACUGAGCUUGCUCUUUUU 

CCGAGCAAGCUCAGUUXXXXXXX 

. CGAGCAAGCUCAGUUUACXXXXXXXXX 

. . GAGCAAGCUCAGUUUAXXXXXXXXXXX 

^Only reads with abundances of greater than 5% are shown. 
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13.14 


2,418 


60.86 



that transcription efficiency appears to decrease if initiation 
begins at the +3 position or further downstream. 

In previous experiments, we relied on mature shRNA 
products to determine transcription initiation sites. Load- 
ing of the mature shRNA sequence into the RNA-induced 
silencing complex might introduce bias because of the strong 
sequence preference of the process. Thus, we designed 
a series of constructs expressing a small RNA (GAUAA 
UUUGUGGUAGUGGUU) with different sequences around 



position +1. The small RNA is -21 nt in length, so we could 
sequence it with our regular small RNA sequencing method. 
The constructs were transfected into 293FT cells separately, 
and the small RNAs were sequenced. The results were con- 
sistent with the experiments done with sh1 005. Transcription 
did not start until there was an A or G, and the initiating A or 
G had to be within the range of positions -1 to +2 for efficient 
transcription (Figure 2c and Supplementary Figure S2). 
We were not able to sequence many reads for any of the 
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Table 2 Representative small RNA reads cloned from sh1005 serial mutations in Figure 2d 

#shR-nccr5-GG uugugggagcaagcucaguuuacaccuuguccgacgguguaaacugagcuugcucuuuuu Reads % 

. . . . GGGAGCAAGCUCAGUUXXXXXXXXXXXX 15,271 52.91 

GGAGCAAGCUCAGUUUXXXXXXXXXXXX 4,458 15.45 

GAGCAAGCUCAGUUUAXXXXXXXXXXX 5,548 19.22 

AGCAAGCUCAGUUUACXXXXXXXXXXXX 2,705 9.37 

#shR-nccr5-GGG uugggggagcaagcucaguuuacaccuuguccgacgguguaaacugagcuugcucuuuuu Reads % 

....GGGAGCAAGCUCAGUUXXXXXXXXXXXX 8,806 25.77 

GGAGCAAGCUCAGUUUXXXXXXXXXXXX 6, 1 09 1 7.88 

GAGCAAGCUCAGUUUAXXXXXXXXXXXX 14,926 43.68 

AGCAAGCUCAGUUUACXXXXXXXXXXXX 2,821 8.26 

#shR-nccr5 dT uuguugagcaagcucaguuuacaccuuguccgacgguguaaacugagcuugcucuuuuu Reads % 

GAGCAAGCUCAGUUUAXXXXXXXXXXXX 24,508 88.35 

AGCAAGCUCAGUUUACXXXXXXXXXXXX 1,779 6.41 

#shR-nccr5 dTT uugugagcaagcucaguuuacaccuuguccgacgguguaaacugagcuugcucuuuuu Reads % 

....GAGCAAGCUCAGUUUAXXXXXXXXXXX 8,337 45.24 

AGCAAGCUCAGUUUACXXXXXXXXXXXX 5,432 29.47 

GCAAGCUCAGUUUACAXXXXXXXXXXXX 3,679 19.96 

#shR-nccr5 dTTT uuggagcaagcucaguuuacaccuuguccgacgguguaaacugagcuugcucuuuuu Reads % 

. . . GAGCAAGCUCAGUUUAXXXXXXXXXXX 1 , 1 37 9.13 

....AGCAAGCUCAGUUUACXXXXXXXXXXXX 3,116 25.01 

GCAAGCUCAGUUUACAXXXXXXXXXXXX 5,278 42.37 

AAGCUCAGUUUACACCXXXXXXXXXXXX 2,727 21.89 



constructs in which A or G was at the +3 or +4 positions, 
suggesting that the number of transcripts decreased if the A 
or G was at, or downstream of, position +3 (Figure 2c and 
Supplementary Figure S2). 

Thus, sequence changes around the putative initiation 
site, ranging from positions -1 to +4, can affect transcription 
initiation. To understand how the sequence upstream of the 
-1 site affects transcription initiation, the T at positions -2 
and -3 was changed to G by point mutation (shR1005-GG 
and shR-1 005-GGG; Figure 2d). Compared with shR1 005-G 
(Figure 2a), the transcription initiation of shR1005-GG and 
shR1 005-GGG started at multiple sites (Figure 2d). We fur- 
ther deleted the Ts at positions -1, -2, and -3 (Figure 2d). 
Compared with sh1005-T (Figure 2a) and sh1005 dT, in 
which the transcription initiation is precise, the transcrip- 
tion of sh1005 dTT and sh1005 dTTT started at multiple 
sites (Figure 2d and Table 2). These data suggest that the 
sequence immediately upstream of the -1 site can affect ini- 
tiation, and two continuous Ts immediately upstream of the 
-1 site are required for accurate initiation of mouse U6-driven 
small RNA expression. Thus, the sequence immediately 
upstream of the -1 site affects transcription initiation by the 
mouse U6 promoter. 

In summary, we identified a new region in the mouse U6 
promoter — the sequence around the putative initiation site 
ranging from position -3 to +4 — that affects the precision and 
efficiency of transcription initiation. Our results suggest the 
following guidelines for using the mouse U6 promoter to gen- 
erate the desired small RNA sequence: the initiation nucleo- 
tide can be either A or G, and it should be within the range of 
positions -1 to +2. It is also critical to maintain the sequence 
immediately upstream of the initiation site, especially a con- 
tinuous sequence of Ts, and avoid inserting A, G, or even C 
upstream of the desired initiation site. 



Applying the new guidelines to improve the design of 
shRNA and gRNA 

The starting nucleotide for natural miRNAs or potent siRNAs 
is often U or A, which is not easy to convert into shRNAs 
driven by the U6 promoter, at least according to the previously 
held concept. With our findings on the mouse U6 promoter 
initiation site, it should be easy to design shRNA or miRNA 
mimics starting with nucleotide A. To test this, we designed 
two shRNAs mimicking miR-451 and miR-31, both starting 
with nucleotide A, and both were highly potent in repressing 
targets (Figure 3a). To confirm that the miRNA mimics were 
processed as predicted, both constructs were transfected 
into 293FT cells, and the small RNAs were sequenced. The 
mature products were exactly the same as for the miRNAs 
(Figure 3a). Thus, improved knowledge of the U6 promoter 
enables the design of shRNA or miRNA mimics starting with 
nucleotide A, and the desired sequence can be generated 
as designed. 

Because it can knock out any gene with ease, the recently 
developed genome-editing technology, based on the 
CRISPR-Cas system, promises to revolutionize the biomedi- 
cal research field as much or more than RNAi. However, the 
target-site selection is limited by the requirements that NGG 
should be the sequence downstream of the target site and 
that the initiation site must be G if the U6 promoter is used 
to transcribe the gRNA. Now, with our new finding about U6 
initiation site selection, we can choose target sites starting 
with either A or G, which doubles the number of possible tar- 
get sites. Four gRNAs targeting the CCR5 gene were cho- 
sen, two of them initiated with A and two initiated with G, to 
test gene-knockout efficiency The gRNA constructs targeting 
CCR5 were cotransfected with a Cas9-expressing plasmid 
into TZM-bl cells. Knockout efficiency was determined by 
flow cytometry 72 hours later, and all gRNAs were found to 
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#shR-miR-451 

uu SvCCGUUACCAUUACUGAGUUuaguaaugguaaugGUUcuuuu-i Reads % 
.. ttvCCGUUACCAUUACXXXXXXXXXXX 34,443 84.94 
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#shR-miR-31 

IgGC a AG AUGCUGGC AU AiGCUGUUga acu gggaa ccUGCU AUGCC AAC AUAUUGCCftUu uuu Reads % 
PGCAAGAUGCUGGCAXXXXXXXXX . . . . . 2,621 94.28 
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Figure 3 Applying the new finding to the design of miRNA mimics and gRNAs starting with A. (a) The functionality of miR-451 and miR-31 
mimics (left) was tested as in Figure 1c. The predominant mature products generated from these two mimics are shown. All small RNAs 
sequenced can be found in Supplementary Tables S1 and S2. The mature miRNA sequences designated in miRBase are shown as 
uppercase letters, (b) The knockout efficiency of gRNAs starting with A or G, with zinc finger nuclease (ZFN) serving as the positive control. 
TZM-bl cells were cotransfected with gRNA constructs and Cas9-expressing plasmid and analyzed for CCR5 expression by flow cytometry 
72 hours later. The bar graph on the right represents the averages (±SD) of triplicates. gRNA, guide RNA. 



be functional (Figure 3b). It is noteworthy that the gRNA4 
construct with the highest functionality is actually initiated 
with A (Figure 3b and Supplementary Table S3). Thus, 
innproved knowledge of the U6 pronnoter doubles the nunnber 
of choices of target sites for CRISPR-Cas systenn-nnediated 
genonne-editing technology. 

Transcription of other pol III promoters 

Until this point, all experiments were based on the nnouse U6 
pronnoter. Next, we checked the transcription initiation site of 
two different pol III pronnoters. V45 pHIPPY PGL3 luciferase 
is a plasnnid using the hunnan U6 and H1 pronnoters to drive 



transcription of two snnall RNAs of -22 nt in length fronn oppo- 
site directions to generate an siRNA duplex targeting the 
luciferase gene.'^^ We sequenced the resulting snnall RNAs to 
deternnine the initiation sites for these pronnoters. As shown 
in Figure 4a, the putative initiation sites for both pronnoters 
were not correct, as the transcripts actually started fronn nnul- 
tiple sites. The reason for this variability nnight be that the 
sequences upstreann of the initiation sites in this construct 
were altered to acconnnnodate the restriction enzynne site and 
ternnination signals (TTTTT) for both pronnoters (Figure 4a). 
This possibility would be consistent with our previous conclu- 
sion that the sequence innnnediately upstreann of the initiation 
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Original 5' -TTGTGGAAAGGACGAAACACC}^ 
hU6 5' -TTGTG( 
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Figure 4 The transcription driven by the H1 promoter is generally variable, (a) Small RNAs generated from V45 pHIPPY PGL3-transfected 
cells. The putative siRNA sequences are underlined. The dotted line indicates the presumed initiation site, and bold letters represent the 
mutated sequence, (b) The initiation site of shRNAs driven by the human H1 promoter with an altered sequence upstream of the initiation site, 
(c) The initiation site of the shRNAs from Figure 4b with the original H1 promoter. shRNA, short hairpin RNA; siRNA, small interfering RNA. 



site contributes to the accuracy of initiation. To confirnn this 
hypothesis, we sequenced snnall RNAs generated fronn sev- 
eral shRNAs expressed by the hunnan U6 pronnoter (pLKO.1 



vector) in which the sequence upstreann of the initiation site 
was kept unaltered. As shown in Supplementary Figure S3a, 
the initiation site for these shRNAs was precisely where 
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predicted, suggesting that maintaining intact the sequence 
upstream of the initiation site is critical for the precision of 
initiation of the human U6 promoter, just like what we show 
for mouse U6 promoter (Figure 2). We also sequenced small 
RNAs generated from several shRNAs driven by the H1 pro- 
moter (pSUPER vector) in which the sequences upstream 
of the initiation sites were altered. As shown in Figure 4b, 
transcription started from multiple sites in all cases. However, 
after we corrected the mutated upstream sequence by point 
mutation, the transcription still started from multiple sites 
(Figure 4c). Thus, it appears that transcription driven by the 
H1 promoter is generally variable, which results in inaccuracy 
at the 5' end of the transcripts. It is noteworthy that the tran- 
scription initiation constantly started at multiple sites ranging 
from -3 to +1 position (Figure 4a-c). 



Discussion 

It has been commonly accepted that transcription from the 
U6 promoter starts at the +1 position (23 nt downstream of 
the TATA box), with G as the preferred initiation nucleotide. In 
this study, we systematically investigated transcription initia- 
tion of the mouse U6 promoter and identified a new region, 
the sequence around the putative initiation site ranging from 
position -3 to +4, which affects transcription initiation accu- 
racy and the efficiency of the U6 promoter. Transcription can 
start with A or G, ranging from the -1 to the +2 positions. This 
new knowledge provides general guidelines for using the U6 
promoter to express small RNAs such as siRNAs/shRNAs 
and gRNAs for GRISPR. 

A previous study showed that the transcription initiation 
of a yeast tRNA promoter prefers purine (both A and G)."^^ 
Our study also showed that the transcription initiation of 
U6 promoter prefers purine, suggesting that the preference 
for purine as the initiation nucleotide might be a conserved 
phenomenon across species. However, for some other pol 
III promoters, such as H1, the transcription can start with 
pyrimidine (Figure 4), suggesting that preference for purine 
as the initiation nucleotide might not be a universal rule for 
pol III promoters. 

Our study also showed that the sequences around the 
putative initiation site in the U6 promoter are important for 
precision of transcription initiation; however, the sequences 
are commonly altered to accommodate a restriction enzyme 
recognition sequence in order to simplify the cloning proce- 
dure, because their effect on transcription initiation was not 
known previously. In fact, when we checked the sequence 
upstream of the initiation site in commercially available vec- 
tors that use the U6 promoter to express small RNAs, we 
found that, in all cases, the sequence upstream of the ini- 
tiation site had been changed (see Supplementary Figure 
S3b). Such changes are likely to result in inaccuracy of the 5' 
end of transcripts. In addition, we showed that transcription 
initiation using another commonly used promoter, H1, which 
has been used in several commercially available vectors (see 
Supplementary Figure S4), is generally variable (Figure 4). 
Thus, inaccuracy of the 5' end of transcripts driven by the pol 
III promoter might be a common problem. In addition to yield- 
ing small RNA products that are not the desired sequences, 
thus affecting on-target functionality, such inaccuracy is also 



likely to increase off-target effects caused by undesired 
small RNA products. Thus, our study provides guidelines 
for using the pol III promoter to transcribe accurately, which 
might increase on-target functionality and minimize off-tar- 
get effects. Although the sequence upstream of the putative 
initiation site in commercially available vectors was modi- 
fied, the sequence in some freely available vectors, such as 
pLL3.7, pLB, and pLKO.1 , which can be easily obtained from 
Addgene, was not changed. These vectors can be used to 
express small RNAs precisely. 

It appears that the initiation constantly starts from multiple 
sites ranging from -3 to +1 position for H1 promoter (Figure 
4). The information might be used to increase the chance to 
design potent shRNAs. Currently, it is still impossible to pre- 
dict the potency of shRNA. By purposely using H1 promoter 
to generate shRNA transcripts with different 5' ends from a 
single construct, which will be processed by Dicer into dis- 
tinct siRNA duplexes because Dicer can make a cleavage 
by measuring the distance from the 5' ends,^^ the chances of 
getting a potent shRNA will be significantly increased since 
one nucleotide change might change the potency dramati- 
cally, although the off-targets might also be increased due to 
multiple mature siRNAs might be generated. 

It has been commonly accepted that the termination sig- 
nal for pol III promoter-driven transcription is a simple four to 
six-residue poly-T,"^^ which is currently used as the terminal 
signal for almost all small RNA expression cassettes driven 
by the pol III promoter. However, we found that poly-T might 
not be an efficient termination signal. miR-155, which has 
five continuous Ts in the middle of its pri-miRNA, can be effi- 
ciently expressed using the U6 promoter, as abundant mature 
miR-155 can be cloned at a level similar to miR-223, which 
does not have a poly-T signal in the middle of its pri-miRNA 
sequence (see Supplementary Figure S5). This finding sug- 
gests that the poly-T signal in miR-155 does not terminate 
transcription efficiently at this site and is consistent with a 
recent report in which Nielsen et al.^^ found that the poly-T 
signal does not cause termination by itself but rather causes 
a pause by Pol III, and termination requires an additional sig- 
nal from the RNA secondary structure. Thus, the currently 
accepted concept of pol III promoter-driven transcription ini- 
tiation and termination is incomplete, which often results in 
expression of small RNAs from these promoters that do not 
have the designed sequence. 

It is surprising that no systematic studies have been done 
to provide guidelines on how to use pol III promoters appro- 
priately, although these promoters have been commonly used 
to express siRNA and shRNA since 2002. In most cases, the 
pol III product sequence is assumed based on current con- 
cepts of Pol III transcription initiation. Very few studies have 
validated the exact sequence experimentally by sequencing. 
As our study shows, the exact mature product of sh1005, 
which has been approved for a clinical trial, is not the pre- 
sumed sequence, and transcription does not initiate from the 
presumed initiation site (Figure 1a). A recent report in which 
U6 promoter was used to express miR-30 also showed that 
the initiation site is not precise.^^ Thus, a significant number 
of previous studies those used pol III promoter to express 
small RNAs will need to be revisited, because the putative 
small RNA sequences expressed might actually be incorrect. 
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Materials and methods 

Plasmids. All the small RNA constructs were designed as 
oligos and inserted into pLL3.7 at restriction sites Hpa\ and 
Xho\. The inserted sequences are listed in Supplementary 
Table S3. V45 pHIPPY PGL3 luciferase, shRNA constructs 
driven by the human U6 promoter (pLKO.1), and the human 
H1 promoter (pSUPER vector) were obtained from Addgene. 

Dual luciferase assay. 293FT cells (Invitrogen, Carlsbad, 
CA) were cultured according to the manufactur- 
er's instructions. The day before transfection, 293FT 
cells were trypsinized and diluted to 10^ cells/ml and 
seeded in 96-well plates in a volume of 100 |jl/well. 
shRNA construct (0.1 |jg) and psiCHECK2 harboring the tar- 
get sequence (0.1 |jg) were cotransfected into 293FT cells 
with lipofectamine 2000 per the manufacturer's instructions. 
The Dual-Glo luciferase assay (Promega, Madison, Ml) was 
performed 24 hours later. 

Small RNA sequencing. Small RNA libraries were constructed 
and sequenced in a similar manner as described previously^^"^^ 
but with major modifications to improve adaptor ligation effi- 
ciency and thereby reduce ligation bias. The same method has 
been used in our recent study.^^ A manuscript describing the 
detailed methods is in preparation. Briefly, 28 hours after the 
constructs were transfected into 293FT cells, the small RNAs 
were purified with the miRNeasy kit (Qiagen, Valencia, CA) per 
the manufacturer's instructions. Small RNA (50 ng) was ligated 
with 3' and 5' linkers (with barcode) using an improved ligation 
method that was optimized comprehensively to minimize the 
ligation bias between different small RNAs. The ligated small 
RNAs were reverse transcribed and amplified with the KARA 
library amplification kit (KARA Biosystems, Woburn, MA) for 10 
cycles, and the library sequenced using the lllumina HiSeq2000, 
San Diego, CA. All reads that were sequenced only once were 
discarded to lower the noise level. The sequenced small RNAs 
are included in Supplementary Table S1. 

Flow cytometry analysis of CCR5 expression. Seventy-two 
hours after transfection of the constructs into TZM-bl cells 
(with lipofectamine 2000 per the manufacturer's instructions), 
the cells were detached and stained with CCR5 antibody con- 
jugated with ARC (BD Pharmingen, San Jose, CA) and sub- 
jected to flow cytometry analysis. The pLL3.7 vector used to 
express small RNAs has a green fluorescent protein marker, 
which can serve as a marker for cells that are transfected; 
thus, green fluorescent protein-positive cells were gated by 
their expression of CCR5 and analyzed. 

Statistical analysis. Student's f-test (two-tailed, assuming 
equal variances on all experimental data sets) was used to 
compare two groups of independent samples. 

Supplementary materials 

Figrue S1. The processing of miRNA-based shRNAs. 
Figure S2. The small RNAs generated from the constructs 
in Figure 2c. 

Figure S3. Human U6 promoter. 



Figure S4. Commercially available vectors using the HI 
promoter. 

Figure S5. The mature miRNAs generated from the indi- 
cated pri-miRNAs. 

Table S1. Small RNAs generated from various constructs. 
Table S2. Simplified data from Supplementary Table S1. 
Table S3. The oligo sequences that were inserted into 
pLL3.7 vectors. 

Acknowledgments. The authors thank Jessica Montoya for 
technical support and Dianne Mitchell and the staff of the 
genomic core facility at Texas Tech University Health Science 
Center for their assistance with DNA sequencing. 

The authors declare that they have no conflict of interest. 

1 . Brummelkamp, TR, Bernards, R and Agami, R (2002). A system for stable expression of 
short interfering RNAs in mammalian cells. Science 2%: 550-553. 

2. Brummelkamp, TR, Bernards, R and Agami, R (2002). Stable suppression of 
tumorigenicity by virus-mediated RNA interference. Cancer Cell 2: 243-247. 

3. Lee, NS, Dohjima, T, Bauer, G, Li, H, Li, MJ, Ehsani, A etal. (2002). Expression of small 
interfering RNAs targeted against HIV-1 rev transcripts in human cells. Nat Biotechnol 20: 
500-505. 

4. Miyagishi, M and Taira, K (2002). U6 promoter-driven siRNAs with four uridine 3' overhangs 
efficiently suppress targeted gene expression in mammalian cells. Nat Biotechnol 20: 
497-500. 

5. Paddison, PJ, Gaudy AA, Bernstein, E, Hannon, GJ and Gonklin, DS (2002). Short hairpin 
RNAs (shRNAs) induce sequence-specific silencing in mammalian cells. Genes Dei/ 16: 
948-958. 

6. Paul, GR Good, PD, Winer, I and Engeike, DR (2002). Effective expression of small 
interfering RNA in human cells. Nat Biotechnol 20: 505-508. 

7. Sui, G, Soohoo, G, Affar el, B, Gay F, Shi, Y, Forrester, WG et al. (2002). A DNA vector- 
based RNAi technology to suppress gene expression in mammalian cells. Proc Natl Acad 
SclUSA99: 5515-5520. 

8. Yu, JY DeRuiter, SL and Turner, DL (2002). RNA interference by expression of short- 
interfering RNAs and hairpin RNAs in mammalian cells. Proc Natl Acad Scl USA 99: 
6047-6052. 

9. Siolas, D, Lerner, G, Burchard, J, Ge, W, Linsley PS, Paddison, PJ et al. (2005). Synthetic 
ShRNAs as potent RNAi triggers. Nat Biotechnol 23: 227-231 . 

10. Jinek, M, Ghylinski, K, Fonfara, I, Hauer, M, Doudna, JA and Gharpentier, E (2012). A 
programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity 
Sc/ence 337: 81 6-821. 

1 1 . Gasiunas, G, Barrangou, R, Horvath, P and Siksnys, V (2012). Gas9-crRNA 
ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in 
bacteria. Proc Natl Acad Scl USA 109: E2579-E2586. 

12. Qi, LS, Larson, MH, Gilbert, LA, Doudna, JA, Weissman, JS, Arkin, AP et al. (201 3). 
Repurposing GRISPR as an RNA-guided platform for sequence-specific control of gene 
expression. Ce// 152: 1 1 73-1 1 83. 

13. Mali, R Yang, L, Esvelt, KM, Aach, J, Guell, M, DiGarlo, JE etal. (2013). RNA-guided 
human genome engineering via Gas9. Science 339: 823-826. 

14. Jinek, M, East, A, Gheng, A, Lin, S, Ma, E and Doudna, J (2013). RNA-programmed 
genome editing in human cells. Elite 2: e00471. 

15. Jiang, W, Bikard, D, Gox, D, Zhang, F and Marraffini, LA (2013). RNA-guided editing of 
bacterial genomes using GRISPR-Gas systems. Nat Biotechnol 3^: 233-239. 

1 6. Hwang, WY Fu, Y Reyon, D, Maeder, ML, Tsai, SQ, Sander, JD et al. (201 3). Efficient 
genome editing in zebrafish using a GRISPR-Gas system. Nat Biotechnol 3^: 227-229. 

1 7. DiGarlo, JE, Norville, JE, Mali, R Rios, X, Aach, J and Ghurch, GM (201 3). Genome 
engineering in Saccharomyces cerevisiae using GRISPR-Gas systems. Nucleic Acids Res 
41:4336-4343. 

18. Gong, L, Ran, FA, Gox, D, Lin, S, Barretto, R, Habib, N etal. (2013). Multiplex genome 
engineering using GRISPR/Gas systems. Science 339: 819-823. 

1 9. Gho, SW, Kim, S, Kim, JM and Kim, JS (201 3). Targeted genome engineering in human 
cells with the Gas9 RNA-guided endonuclease. Nat Biotechnol 3V. 230-232. 

20. Ghang, N, Sun, G, Gao, L, Zhu, D, Xu, X, Zhu, X et al. (201 3). Genome editing with RNA- 
guided Gas9 nuclease in zebrafish embryos. Cell Res 23: 465^72. 

21 . Macrae, IJ, Zhou, K, Li, F, Repic, A, Brooks, AN, Gande, WZ et al. (2006). Structural basis 
for double-stranded RNA processing by Dicer. Sc/ence 311: 195-198. 

22. Park, JE, Heo, I, Tian, Y Simanshu, DK, Ghang, H, Jee, D et al. (201 1 ). Dicer recognizes 
the 5' end of RNA for efficient and accurate processing. Nature 475: 201-205. 

23. Kunkel, GR, Maser, RL, Galvet, JP and Pederson, T (1 986). U6 small nuclear RNA is 
transcribed by RNA polymerase III. Proc Natl Acad Scl USA 83: 8575-8579. 

24. Goomer, RS and Kunkel, GR (1 992). The transcriptional start site for a human U6 small 
nuclear RNA gene is dictated by a compound promoter element consisting of the PSE and 
the TATA box. Nucleic Acids Res 20: 4903^91 2. 



Molecular Therapy — Nucleic Acids 



Small RNA Expression With Pol III Promoters 

Ma etal 



11 



25. Mussolino, C and Cathomen, T (201 3). RNA guides genome engineering. Nat Biotechnol 
31:208-209. 

26. An, DS, Donaiiue, RE, Kamata, M, Poon, B, Metzger, M, Mao, SH etal. (2007). Stable 
reduction of CCR5 by RNAi tlirougli liematopoietic stem cell transplant in non-human 
primates. Proc Natl Acad Sci USA 1 04: 1 31 1 0-1 31 1 5. 

27. Shimizu, S, Kamata, M, Kittipongdaja, R Chen, KN, Kim, S, Pang, S etal (2009). 
Characterization of a potent non-cytotoxic shRNA directed to the HIV-1 co-receptor CCR5. 
Genet Vaccines Ther7: 8. 

28. Kamata, M, Liu, S, Liang, M, Nagaoka, Y and Chen, IS (2010). Generation of human 
induced pluripotent stem cells bearing an anti-HIV transgene by a lentiviral vector carrying 
an internal murine leukemia virus promoter. Hum Gene Ther2^■. 1555-1567. 

29. Liang, M, Kamata, M, Chen, KN, Pariente, N, An, DS and Chen, IS (2010). Inhibition 
of HIV-1 infection by a unique short hairpin RNA to chemokine receptor 5 delivered 
into macrophages through hematopoietic progenitor cell transduction. J Gene Med 12: 
255-265. 

30. Shimizu, S, Hong, R Arumugam, B, Pokomo, L, Boyer, J, Koizumi, N etai (2010). A 
highly efficient short hairpin RNA potently down-regulates CCR5 expression in systemic 
lymphoid organs in the hu-BLT mouse model. S/ood 11 5: 1534-1544. 

31 . Symonds, GR Johnstone, HA, Millington, ML, Boyd, MR Burke, BP and Breton, LR (2010). 
The use of cell-delivered gene therapy for the treatment of HIV/AIDS. Immunol Res 48: 
84-98. 

32. Ringpis, GE, Shimizu, S, Arokium, H, Camba-Colon, J, Carroll, MV, Cortado, R et 
al. (2012). Engineering HIV-1 -resistant T-cells from short-hairpin RNA-expressing 
hematopoietic stem/progenitor cells in humanized BLT mice. PLoS One 7: e53492. 

33. An, DS, Qin, FX, Auyeung, VC, Mao, SH, Kung, SK, Baltimore, D etal (2006). 
Optimization and functional effects of stable short hairpin RNA expression in primary 
human lymphocytes via lentiviral vectors. Mol Ther^^■. 494-504. 

34. Silva, JM, Li, MZ, Chang, K, Ge, W, Golding, MC, Rickles, RJ et al (2005). Second- 
generation shRNA libraries covering the mouse and human genomes. Nat Genet 37: 
1281-1288. 

35. McBride, JL, Boudreau, RL, Harper, SQ, Staber, PD, Monteys, AM, Martins, I etal (2008). 
Artificial mlRNAs mitigate shRNA-mediated toxicity in the brain: implications for the 
therapeutic development of RNAi. Proc Natl Acad Scl USA 105: 5868-5873. 

36. Ma, H, Wu, Y, Choi, JG and Wu, H (201 3). Lower and upper stem-single-stranded RNA 
junctions together determine the Drosha cleavage site. Proc Natl Acad Scl USA 110: 
20687-20692. 

37. Newman, MA, Mani, V and Hammond, SM (201 1 ). Deep sequencing of microRNA 
precursors reveals extensive 3' end modification. RNA 17: 1795-1803. 



38. Ameres, SL, Horwich, MD, Hung, JH, Xu, J, Ghildiyal, M, Weng, Z etal (2010). Target 
RNA-directed trimming and tailing of small silencing RNAs. Science 32B: 1534-1539. 

39. Wu, H, Ye, C, Ramirez, D and Manjunath, N (2009). Alternative processing of primary 
microRNA transcripts by Drosha generates 5' end variation of mature microRNA. PLoS 
One A: e7566. 

40. Katoh, T Sakaguchi, Y Miyauchi, K, Suzuki, T Kashiwabara, S, Baba, T ef al (2009). 
Selective stabilization of mammalian microRNAs by 3' adenylation mediated by the 
cytoplasmic poly(A) polymerase GLD-2. Genes Dei/ 23: 433^38. 

41 . Gu, S, Jin, L, Zhang, Y Huang, Y Zhang, F, Valdmanis, PN et al (201 2). The loop position 
of shRNAs and pre-miRNAs is critical for the accuracy of dicer processing in vivo. Ce// 151 : 
900-911. 

42. Kaykas, A and Moon, RT (2004). A plasmid-based system for expressing small interfering 
RNA libraries in mammalian cells. BMC Cell Biol 5: 16. 

43. Zecherle, GN, Whelen, S and Hall, BD (1 996). Purines are required at the 5' ends of 
newly initiated RNAs for optimal RNA polymerase III gene expression. Mol Cell 6/0/ 16: 
5801-5810. 

44. Bogenhagen, DF, Sakonju, S and Brown, DD (1 980). A control region in the center of the 
5S RNA gene directs specific initiation of transcription: II. The 3' border of the region. Cell 
19:27-35. 

45. Nielsen, S, Yuzenkova, Y and Zenkin, N (201 3). Mechanism of eukaryotic RNA polymerase 
III transcription termination. Sc/ence 340: 1577-1580. 

46. Neilson, JR, Zheng, GX, Burge, CB and Sharp, PA (2007). Dynamic regulation of mlRNA 
expression in ordered stages of cellular development. Genes Deu21: 578-589. 

47. Wu, H, Neilson, JR, Kumar, R Manocha, M, Shankar, R Sharp, PA etal (2007). mlRNA 
profiling of naive, effector and memory CD8 T cells. PLoS One 2: el 020. 




This work is licensed, under a Creative Commons 
Attribution-NonCommercial-ShareAlike 3.0 Unported 



License. The images or other third party material in this article are 
included in the article's Creative Commons license, unless indicated 
otherwise in the credit line; if the material is not included under the 
Creative Commons license, users will need to obtain permission from 
the license holder to reproduce the material. To view a copy of this 
license, visit http://creativecommons.0rg/licenses/by-nc-sa/3.O/ 



Supplementary Information accompanies this paper on the Molecular Therapy-Nucleic Acids website (http://www.nature.com/mtna) 



www.moleculartherapy.org/ mtna 



